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(54) Deriving process models for workflow management systems from audit trails 

(57) The present invention relates to the area of 
workflow management systems (WFMS). More particu- 
larly the invention is related to a methodology of auto- 
matically deriving and steadily improving a process 
model executed by the WFMS. 

Current methodologies of defining process models 
are a priori in nature, i.e. it is assumed that the correct 
process model of a business process to be supported 
must be specified before workflow technology can be 
used to execute and control the underlying activities 
making up the business process. 

The current invention dramatically simplifies and 
automates the process of modeling a business model of 
a business process. The invention allows to start just 
with the set of unrelated activities and discover the real 
world relations between them at a later point in time: 
data mining and OLAP technologies are exploited for 
this discovery. The current invention thus proposes a 
posteriori methodology. For that purpose the precise 
underlying process model is derived at a later point in 
time based on audit data collected by the WFMS during 
the early deployment of a process model. 
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Description 

1 Background of the Invention 

1.1 Field of the Invention 

[0001] The present invention relates to the field of 
computer systems acting as workflow management 
systems (WFMS). 

1.2 Description and Disadvantages of Prior Art 

[0002] A new area of technology with increasing 
importance is the domain of Workf low-Management- 
Systems (WFMS). WFMS support the modelling and 
execution of business processes. Business processes 
control which piece of work of a network of pieces of 
work will be performed by whom and which resources 
are exploited for this work, i.e. a business process 
describes how an enterprise will achieve its business 
goals. The individual pieces of work might be distributed 
across a multitude of different computer systems con- 
nected by some type of network 
[0003] The process of designing, developing and 
manufacturing a new product and the process of chang- 
ing or adapting an existing product presents many chal- 
lenges to product managers and engineers to bring the 
product to market for the feast cost and within schedule 
while maintaining or even increasing product quality. 
Many companies are realizing that the conventional 
product design process is not satisfactory to meet these 
needs. They require early involvement of manufacturing 
engineering, cost engineering, logistic planning, pro- 
curement, manufacturing, service and support with the 
design effort. Furthermore, they require planning and 
control of product data through design, release, and 
manufacturing. 

[0004] The correct and efficient execution of business 
processes within a company, e. g. development or pro- 
duction processes, is of enormous importance for a 
company and has significant influence on company's 
overall success in the market place. Therefore, those 
processes have to be regarded similar as technology 
processes and have to be tested, optimized and moni- 
tored. The management of such processes is usually 
performed and supported by a computer based process 
or workflow management system. 
[0005] In D. J. Spoon: "Project Management Environ- 
ment", IBM Technical Disclosure Bulletin, Vol. 32, No. 
9A, February 1990, pages 250 to 254, a process man- 
agement environment is described including an operat- 
ing environment, data elements, and application 
functions and processes. 

[0006] In R. T. Marshak: "IBM's FlowMark, Object-Ori- 
ented Workflow for Mission-Critical Applications", Work- 
group Computing Report (USA), Vol. 17, No. 5, 1994, 
page 3 to 13, the object character of IBM FlowMark as 
a client/server product built on a true object model that 



is targeted for mission-critical production process appli- 
cation development and deployment is described. 
[0007] In H. A. Inniss and J. H. Sheridan: "Workflow 
Management Based on an Object-Oriented Paradigm", 

5 IBM Technical Disclosure Bulletin, Vol. 37, No. 3, March 
1994, page 185, other aspects of object-oriented mod- 
elling on customization and changes are described. 
[0008] In F. Leymann and D. Roller: "Business Proc- 
ess Management with FlowMark", Digest of papers, 

10 Cat. No. 94CH3414-0, Spring COMPCON 94, 1994, 
pages 230 to 234, the state-of-the-art computer process 
management tool IBM FlowMark is described. The 
meta model of IBM FlowMark is presented as well as 
the implementation of IBM FlowMark. The possibilities 

15 of IBM FlowMark for modelling of business processes 
as well as their execution are discussed. The product 
IBM FlowMark is available for different computer plat- 
forms and documentation for IBM FlowMark is available 
in every IBM branch. 

20 [0009] In F. Leymann: "A meta model to support the 
modelling and execution of processes", Proceedings of 
the 1 1th European Meeting on Cybernetics and System 
Research EMCR92, Vienna, Austria. April 21 to 24, 
1992, World Scientific 1992, pages 287 to 294, a meta 

25 model for controlling business processes is presented 
and discussed in detail. 

[0010] The "IBM FlowMark for OS/2", document 
number GH 19-8215-01, IBM Corporation, 1994, availa- 
ble in every IBM sales office, represents a typical mod- 

30 em, sophisticated, and powerful workflow management 
system. It supports the modelling of business proc- 
esses as a network of activities; refer for instance to 
"Modeling Workflow", document number SH 19-8241, 
IBM Corporation, 1996. This network of activities, the 

35 process model, is constructed as a directed, acyclic, 
weighted, colored graph. The nodes of the graph repre- 
sent the activities or workitems which are performed. 
The edges of the graph, the control connectors, 
describe the potential sequence of execution of the 

40 activities. Definition of the process graph is via the IBM 
FlowMark Definition Language (FDL) or the built-in 
graphical editor. The runtime component of the work- 
flow manager interprets the process graph and distrib- 
utes the execution of activities to the right person at the 

45 right place, e. g. by assigning tasks to a work list accord- 
ing to the respective person, wherein said work list is 
stored as digital data within said workflow or process 
management computer system. 
[0011] In F. Leymann and W. Altenhuber: "Managing 

so business processes as an information resource", IBM 
Systems Journal, Vol. 32(2). 1994, the mathematical 
theory underlying the IBM FlowMark product is 
described. 

[001 2] In D. Roller: "Verif ikation von Workflows in IBM 
55 FlowMark", in J. Becker und Q. Vossen (Hrsg.): 
"Geschaeftsprozessmodellierung und Workflows", 
International Thompson Publishing, 1995, the require- 
ment and possibility of the verification of workflows is 
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described. Furthermore the feature of graphical anima- 
tion for verification of the process logic is presented as 
it is implemented within the IBM FlowMark product. 
[0013] For implementing a computer based process 
management system, firstly the business processes s 
have to be analyzed and, as the result of this analysis, a 
process model has to be constructed as a network of 
activities corresponding to the business process. In the 
IBM FlowMark product, the process models are not 
transformed into an executable. At run time, an instance 10 
of the process is created from the process model, called 
a process instance. This process instance is then inter- 
preted dynamically by the IBM FlowMark product. 
[001 4] Another area of the technology relates to data 
mining technology. Data mining technology allows to is 
automatically detect patterns in huge amounts of data. 
Especially, it is not based on rteratively verifying hypoth- 
esises about potential patterns within the data (which is 
intractable, in general) but it is deriving such patterns 
based on much more efficient analysis algorithms. 20 
[0015] OLAP (OnLine Analytical Processing) tools 
allow the fast access to large amounts of data, many dif- 
ferent representations and aggregations of the same 
results sets, and the access of data related to already 
derived results. While this is in theory true for standard 25 
relational DBMSs new technologies like multi-dimen- 
sional hypercubes or time-series allow this in a very effi- 
cient way. 

[001 6] So far no approach has been made to exploit 
data mining and OLAP technologies for WFMS. 30 
[001 7] Moreover the various tasks and activities being 
part of a business process have been developed inde- 
pendently and do not know of their mutual existence. 
Very often it depends on the knowledge of the users of 
these pieces of software in which sequence and under 35 
which conditions the individual activities have to exe- 
cuted. One goal of business reengineering is to deter- 
mine the models of existing business processes. The 
business processes are quite often only known by the 
people working on the various pieces of the business 40 
processes, i.e. various people have local knowledge" of 
the business processes they are working on. Thus often 
there is no single person in an enterprise knowing the 
global interdependence and sequence of execution of 
the individual pieces. Different business reengineering 45 
methodologies are used today to capture this local 
knowledge and to derive the "global model" of the over- 
all business process out of it. But in essence all these 
approaches actually only help to collect the global busi- 
ness process information and help to represent the so 
information in certain ways. Finally all that information 
has to be gathered by interviews from the users of these 
software pieces. 

[0018] The derivation of process models is typically 
addressed by business reengineering tools through 55 
support of a graphical user interface for collecting proc- 
ess information and drawing graphical representations 
of process models, it is the responsibility of the tool user 



to obtain all the information to come up with the process 
model. 

[0019] In many situations process information is not 
complete, if available at all. The quality of the deduced 
process model is highly dependent on the quality of the 
data supplied by the people. Often, this turns out to be 
an obstruction of the successful deployment of workflow 
technology or of its deployment at all. 
[0020] The inherent problem is that current methodol- 
ogies of business reengineering are a priori in nature, 
i.e. it is assumed that the correct process model of a 
business process to be supported must be specified 
before workflow technology can be used. 

1.3 Objective of the Invention 

[0021] The invention is based on the objective to 
improve the method of defining process models for 
workflow management systems. 

2 Summary and Advantages of the Invention 

[0022] The objective of the invention is solved by claim 
1. The current invention teaches a computerized 
method automatically adapting a process model. Said 
process model may be a model of a business process. 
Such process models are used for instance within com- 
puter systems for controlling and executing the underly- 
ing business process. It may comprise a multitude of 
activities being part of said business process. A process 
model defines all possible allowed sequences of activi- 
ties of said business-process. The proposed method of 
adapting is characterized by a pattern collection step. 
The pattern collection step is based on a current proc- 
ess model. During said pattern collection step each 
activity of the current process model which is executed 
is recorded within an audit trail by audit trail records. 
Such an audit trail record encompasses a process 
model instance identification of the process model 
instance within said activity is being executed. In addi- 
tion said audit trail record encompasses an activity iden- 
tification of the activity being executed. Moreover said 
audit trail record encompasses the execution interval of 
said activity comprising a starting time and a termina- 
tion time of said activity being executed. The proposed 
method of adapting is further characterized by a pattern 
analysis step which analyzes after a predefined sam- 
pling period said audit trail records and which generates 
a next process model by a first and a second step. In the 
first step the method determines, based on the current- 
process-model, those activities within the audit trail 
records having identical process model instance identi- 
fications which are processed sequentially and those 
activities which are processed in parallel. Activities exe- 
cuted sequentially are recorded with non-overlapping 
execution-intervals, and activities, which are processed 
in parallel, being recorded with overlapping execution- 
intervals. In the second step the method uses the cur- 
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rent process model to generate a next process model by 
re-ordering those activities, allowed to be processed in 
parallel according to the current process model but 
actually have been executed sequentially only, as deter- 
mined in said first step, into sequential execution order 
based on the time order of said execution intervals. 
[0023] The technique proposed by the current inven- 
tion opens a new type of approach for defining a proc- 
ess model. While the current approaches to define a 
process model are 'a priori* approaches, which means 
that a process model has to be defined externally and 
has to model a business process completely and per- 
fectly. The existence of such a process model is the pre- 
requisite for its execution. The current approach in 
contrast is an 'a posteriori' approach, which means that 
it is possible to start already with a rough approximation 
of a process model and the proposed method is able to 
permanently improve the process model. In an evolu- 
tionary process the correct process model will be deter- 
mined by the proposed method. 
[0024] Typically the various tasks and activities being 
part of a business process have been developed inde- 
pendently and do not know of their mutual existence. 
Very often it depends on the knowledge of the users of 
these pieces of software in which sequence and under 
which conditions the individual activities have to exe- 
cuted. The business processes are quite often only 
known by the people working on the various pieces of 
the business processes, i.e. various people have local 
knowledge" of the business processes they are working 
on. Thus often there is no single person in an enterprise 
knowing the global interdependence and sequence of 
execution of the individual pieces. In such case the pro- 
posed methodology nevertheless will determine the cor- 
rect process model. 

[0025] In many situations process information 
required to define in an *a priori' approach is not com- 
plete, if available at all. The quality of the deduced proc- 
ess model is highly dependent on the quality of the data 
supplied by the people. Often, this turns out to be an 
obstruction of the successful deployment of workflow 
technology or of its deployment at all. Due to the evolu- 
tionary approach of the current teaching incomplete 
business process information is no longer an obstacle. 
[0026] As the current invention proposes a posteriori 
methodology deriving the precise underlying process 
model at a later point in time based on audit data col- 
lected during the deployment of the process model 
finally high quality models for the business processes 
will be achieved. Moreover even if the underlying busi- 
ness process has to change, for instance due to rea- 
sons inherent to the business itself and not related to its 
computer representation, the current invention will be 
able to adapt the process model to these changes. 
[0027] Additional advantages of the invention are 
accomplished by claim 2. 

According this teaching the computerized method of 
automatically adapting a process model is repeated 



permanently by using said next process model (gener- 
ated by the previous iteration) as the current process 
model, being the basis for a repeated iteration of the 
methodology. 

5 [0028] Such an approach provides the full flexibility of 
the current teaching. Based on the teaching of claim 2 
the methodology may be repeated permanently this will 
guarantee to finally generate the authentic representa- 
tion of the business process as a process model. Also 

10 due to this strategy the current teaching will also be able 
to adapt a process model if the actual business process 
has changed due to external reasons. 
[0029] Further additional advantages of the invention 
are accomplished by claim 3. 

15 Claim 3 teaches that a pattern analysis step which 
never re-orders two activities in a sequential execution 
order if said two activities have been determined at least 
once as being processed in parallel. 
[0030] The advantages of this approach is that the 

20 current teaching becomes stable against accidental 
serial execution of activities. Any serialization within a 
process model represents a further restriction of the 
process model. The teaching of claim 3 allows the cur- 
rent teaching to distinguish sequential execution of 

25 activities due to random influences from sequential exe- 
cution of activities due to the inherent nature of the busi- 
ness process. 

[0031] Additional advantages of the invention are 
accomplished by claim 4. 

30 [0032] According this teaching said current process 
model and said next process model are managed and 
executed by a Workf low-Management-System (WFMS). 
[0033] According this teaching the complete capabili- 
ties of the current teaching is made available to WFMS. 

35 [0034] Further additional advantages of the invention 
are accomplished by claim 5. 
Claim 5 teaches that the said pattern collection step is 
executed by said WFMS. 

[0035] According to this teaching the capabilities, 
40 which are already available within a WFMS, are 
exploited for the current method of automatically adapt- 
ing a process model. 

[0036] Further additional advantages of the invention 
are accomplished by claim 6. 
45 Claim 6 teaches to start the said repeated application of 
the methodology with an initial current process model 
which allows for parallel execution of all activities within 
said initial 

current process model. 

so [0037] Thus the current teaching suggests to start 
with a very simple initial current process model. It sim- 
ple consists of the unrelated collection of activities not 
imposing any initial precedence between the individual 
activities. Moreover this initial current process model, 

55 due to its generality and simplicity, can also be gener- 
ated automatically. 
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3 Brief Description of the Drawings 
[0038] 

Figure 1 is reflecting one iteration step of the current s 
invention starting with the initial process 
model to derive a first and improved next 
process model. 

Figure 2 visualizes a simple example of an audit trail 

encompassing a multitude of audit trail 10 
records. 

Figure 3 depicts the precedence relations derived 
from the audit trail records visualized in the 
example of Figure 2. 

Figure 4 visualizes the resulting process model of 15 
the current example if the individual prece- 
dence relations are combined to form the 
global picture. 



4 Description of the Preferred Embodiment 20 

[0039] The current invention is illustrated based on 
IBM's FlowMark workflow management system. Of 
course any other WFMS could be used instead. Fur- 
thermore the current teaching applies also to any other 25 
type of system which offers WFMS functionalities not as 
a separate WFMS but within some other type of system. 
[0040] It has to be stressed especially that using a 
WFMS for producing the audit log for applying OLAP or 
data mining technology is not vital for the current inven- 30 
tion. Other techniques like an appropriate wrappering or 
instrumentalization of existing applications might me 
used for generating the required log records. But using 
a WFMS for this purpose is convenient in many situa- 
tions. Thus, without limiting the scope of the current 35 
invention, the exploitation of a WFMS for what follows is 
assumed. 

4.1 Introduction 

40 

[0041] The following is a short outline on the basic 
concepts of a workflow management system based on 
IBM's FlowMark WFMS: 

[0042] From an enterprise point of view the manage- 
ment of business processes is becoming increasingly 45 
important: business processes or process for short 
control which piece of work will be performed by whom 
and which resources are exploited for this work, i.e. a 
business process describes how an enterprise will 
achieve its business goals. A WFMS may support both, so 
the modeling of business processes and their execu- 
tion. 

[0043] Modeling of a business process as a syntacti- 
cal unit in a way that is directly supported by a software 
system is extremely desirable. Moreover, the software 55 
system can also work as an interpreter basically getting 
as input such a model: The model, called a process 
model or workflow model, can then be instantiated 



and the individual sequence of work steps depending 
on the context of the instantiation of the model can be 
determined. Such a model of a business process can be 
perceived as a template for a class of similar processes 
performed within an enterprise; it is a schema describ- 
ing all possible execution variants of a particular kind of 
business process. An instance of such a model and its 
interpretation represents an individual process, i.e. a 
concrete, context dependent execution of a variant pre- 
scribed by the model. A WFMSs facilitates the manage- 
ment of business processes. It provides a means to 
describe models of business processes (build time) and 
it drives business processes based on an associated 
model (run time). The meta model of IBM's WFMS 
FlowMark, i.e. the syntactical elements provided for 
describing business process models, and the meaning 
and interpretation of these syntactical elements, is 
described next. 

[0044] A process model is a complete representation 
of a process, comprising a process diagram and the set- 
tings that define the logic behind the components of the 
diagram. Using various services provided by FlowMark 
these buildtime definitions the process models are then 
converted into process templates for use by FlowMark 
at runtime. Important components of a FlowMark proc- 
ess model are: 

■ Processes 

■ Activities 

■ Blocks 

■ Control Flows 

■ Connectors 

■ Data Containers 

■ Data Structures 

■ Conditions 

■ Programs 

■ Staff 

Not all of these elements will be described below. 
[0045] On this background a process, modeled by a 
process model within FlowMark, is a sequence of activ- 
ities that must be completed to accomplish a task. The 
process is the top-level element of a FlowMark workflow 
model. In a FlowMark process, it can be defined: 

• How work is to progress from one activity to the 
next 

• Which persons are to perform activities and what 
programs they are to use 

• Whether any other processes, called subproc- 
esses, are nested in the process 

Of course multiple instances of a FlowMark process can 
run in parallel. 

[0046] Activities are the fundamental elements of the 
meta model. An activity represents a business action 
that is from a certain perspective a semantical entity of 
its own. With the model of the business process it might 
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have a fine-structure that is then represented in turn via 
a model, or the details of it are not of interest at all from 
a business process modeling point of view. Refinement 
of activities via process models allows for both, mode- 
ling business processes bottom-up and top-down. 5 
Activities being a step within a process represents a 
piece of work that the assigned person can complete by 
starting a program or another process. In a process 
model, the following information is associated with each 
activity: 

• What conditions must be met before the activity can 
start 

• Whether the activity must be started manually by a 
user or can start automatically 

• What condition indicates that the activity is com- 
plete 

• Whether control can exit from the activity automati- 
cally or the activity must first be confirmed as com- 
plete by a user 

• How much time is allowed for completion of the 
activity 

• Who is responsible for completing the activity 

• Which program or process is used to complete the 
activity 

• What data is required as input to the activity and as 
output from it 

A FlowMark process model consists of the following 
types of activities: 

Program activity: Has a program assigned to perform 
it. The program is invoked when the activity is started. In 
a fully automated workflow, the program performs the 
activity without human intervention. Otherwise, the user 
must start the activity by selecting it from a runtime work 
list. Output from the program can be used in the exit 
condition for the program activity and for the transition 
conditions to other activities. 
Process activity: Has a (sub-)process assigned to per- 
form it. The process is invoked when the activity is 
started. A process activity represents a way to reuse a 
set of activities that are common to different processes. 
Output from the process, can be used in the exit condi- 
tion for the process activity and for the transition condi- 
tions to other activities. 

[0047] The flow of control, i.e. the control flow 
through a running process determines the sequence in 
which activities are executed. The FlowMark workflow 
manager navigates a path through the process that is 
determined by the evaluation to true of start conditions, 
exit conditions, and transition conditions. 
[0048] The results that are in general produced by the 
work represented by an activity is put into an output 
container, which is associated with each activity. Since 
an activity will in general require to access output con- 
tainers of other activities, each activity is associated in 
addition with an input container too. At run time, the 
actual values for the formal parameters building the 



input container of an activity represent the actual con- 
text of an instance of the activity. Each data container is 
defined by a data structure. A data structure is an 
ordered list of variables, called members, that have a 
name and a data type. Data connectors represent the 
transfer of data from output containers to input contain- 
ers. When a data connector joins an output container 
with an input container, and the data structures of the 
two containers match exactly, the FlowMark workflow 
manager maps the data automatically. 
[0049] Connectors link activities in a process model. 
Using connectors, one defines the sequence of activi- 
ties and the transmission of data between activities. 
Since activities might not be executed arbitrarily they 
are bound together via control connectors. A control 
connector might be perceived as a directed edge 
between two activities; the activity at the connector's 
end point cannot start before the activity at the start 
point of the connector has finished (successfully). Con- 
trol connectors model thus the potential flow of control 
within a business process model. Default connectors 
specify where control should flow when the transition 
condition of no other control connector leaving an activ- 
ity evaluates to true. Default connectors enable the 
workflow model to cope with exceptional events. Data 
connector specify the flow of data in a workflow model. 
A data connector originates from an activity or a block, 
and has an activity or a block as its target. One can 
specify that output data is to go to one target or to mul- 
tiple targets. A target can have more than one incoming 
data connector. 

[0050] Conditions are the means by which it is possi- 
ble to specify the flow of control in a process. In Flow- 
Mark process models logical expressions can be 
defined that are evaluated by FlowMark at runtime to 
determine when an activity may start, end, and pass 
control to the next activity. Start conditions are condi- 
tions that determine when an activity with incoming con- 
trol connectors can start. The start condition may 
specify that all incoming control connectors must evalu- 
ate to true, or it may specify that at least one of them 
must evaluate to true. Whatever the start condition, all 
incoming connectors must be evaluated before the 
activity can start. If an activity has no incoming control 
connectors, it becomes ready when the process or 
block containing it starts. In addition, a Boolean expres- 
sion called transition condition is associated with 
each control connector. Parameters from output con- 
tainers of activities having already produced their 
results are followed as parameters referenced in transi- 
tion conditions. When at run time an activity terminates 
successfully alt control connectors leaving this activity 
are determined and the truth value of the associated 
transition conditions is computed based on the actual 
values of their parameters. Only the end points of con- 
trol connectors the transition conditions of which evalu- 
ated to TRUE are considered as activities that might be 
executed based on the actual context of the business 
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process. Transition conditions model thus the context 
dependent actual flow of control within a business proc- 
ess (i.e. an instance of a model). Business processes 
encompass long running activities in general; such an 
activity need to be allowed to become interrupted. Thus, 
termination of an activity does not necessarily indicate 
that the associated task has been finished successfully. 
In order to allow the measurement of successfullness of 
the work performed by an activity a Boolean expression 
called exit condition is associated with each activity. 
Exactly the activities the exit condition of which evalu- 
ated to true in the actual context are treated as success- 
fully terminated. For determination of the actual control 
flow precisely the successfully terminated activities are 
considered. Thus the logical expression of an exit con- 
dition, if specified, must evaluate to true for control to 
pass from an activity or block. 
[0051 ] Beside describing the potential flow of control 
and data between activities a business process model 
also encompasses the description of the flow of the 
activities itself between "resources" actually performing 
the pieces of work represented by each activity. A 
resource may be specified as a particular program, per- 
son, a role, or an organizational unit. At run time tasks 
are resolved into requests to particular persons to per- 
form particular activities resulting in workitems for that 
person. Staff assignments are the means to distribute 
activities to the right people in the sequence prescribed 
by the control flow aspect of a business process model. 
Each activity in a process is assigned to one or more 
staff members defined in the FlowMark database. 
Whether an activity is started manually by the user or 
automatically by the FlowMark workflow manager, and 
whether it requires user interaction to complete or com- 
pletes automatically, a staff member must be assigned 
to it. FlowMark staff definition entails more than identify- 
ing people at your enterprise to the FlowMark database. 
For each person defined, you can specify a level, an 
organization, and multiple roles. These attributes can 
be used at run time to dynamically assign activities to 
people with suitable attributes. 
[0052] Process definition includes modeling of activi- 
ties, control connectors between the activities, 
input/output container, and data connectors. A process 
is represented as a directed acyclic graph with the activ- 
ities as nodes and the control/data connectors as the 
edges of the graph. The graph is manipulated via a built- 
in, event-driven, CUA compliant graphic editor. The data 
containers are specified as named data structures. 
These data structures themselves are specified via the 
DataStructu re Definition facility. FlowMark distinguishes 
three main types of activities: program activities, proc- 
ess activities, and blocks. Program activities are imple- 
mented through programs. The programs are registered 
via the Program Definition facility. Blocks contain the 
same constructs as processes, such as activities, con- 
trol connectors etc. They are however not named and 
have their own exit condition. If the exit condition is not 



met, the block is started again. The block thus imple- 
ments a Do Until construct. Process activities are imple- 
mented as processes. These subprocesses are defined 
separately as regular, named processes with all its 

s usual properties. Process activities offer great flexibility 
for process definition. It not only allows to construct a 
process through permanent refinement of activities into 
program and process activities (top-down), but also to 
build a process out of a set of existing processes (bot- 

w tom-up) . In particular, process activities help to organize 
the modeling work if several process modeler are work- 
ing together. It allows the team members to work inde- 
pendently on different activities. Program and process 
activities can be associated with a time limit. The time 

15 limit specifies how long the activity may take. If the time 
is exceeded, a designated person is notified, if this per- 
son does not react within another time limit, the process 
administrator is notified. It not only helps to recognize 
critical situation but also to detect process deficiencies 

20 as all notifications are recorded in an audit trail. 

[0053] AJi data structures used as templates for the 
containers of activities and processes are defined via 
the Data Structure Definition Facility. Data Structures 
are names and are defined in terms of elementary data 

25 types, such as float, integer, or string and references to 
existing data structures. Managing data structures as 
separate entities has the advantage that all interfaces of 
activities and their implementations are managed con- 
sistently in one place (similar to header files in program- 
so ming languages). 

[0054] All programs which implement program activi- 
ties are defined via the Program Registration Facility. 
Registered for each program is the name of the pro- 
gram, its location, and the invocation string. The invoca- 

35 tion string consists of the program name and the 
command string passed to the program. 
[0055] Before process instances can be created, the 
process model must be translated to ensure the correct- 
ness and completeness of the process model. The 

40 translated version of the model is used as a template 
when a process instance is created. This allows to 
make changes to the process model without affecting 
executing process instances. A process instance is 
started either via the graphical interface of via the calla- 

45 ble process application programming interface. When a 
process is started, the start activities are located, the 
proper people are determined, and the activities are 
posted onto the work list of the selected people. If a user 
selects the activity, the activity is executed and removed 

so from the work list of any other user to whom the activity 
has been posted. After an activity has executed, its exit 
condition is evaluated. If not met, the activity is resched- 
uled for execution, otherwise all outgoing control con- 
nectors and the associated transition conditions are 

55 evaluated. A control connector is selected, if the condi- 
tion evaluates to TRUE. The target activities of the 
selected control connectors are then evaluated. If their 
start conditions are true, they are posted to the work list 
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of selected people. A process is considered terminated, 
if all end activities have completed. To make sure that all 
end activities finish, a dead path elimination is per- 
formed. It removes all edges in the process graph which 
can never be reached due to failing transition condi- 
tions. All information about the current state of a proc- 
ess is stored in the database maintained by the server. 
This allows for forward recovery in the case of crashes. 

4.2 The Basic Problem 

[0056] The current invention is addressing the prob- 
lem of modeling business processes. Various tools 
exists to support the specification of models of business 
processes, for example business reengineering tools or 
modeling components of workflow management sys- 
tems (WFMS). Once having imported the model of a 
business process into a WFMS it can be instantiated 
and enacted by the WFMS to guide users in correctly 
performing their business tasks. In general, it is 
assumed that the undertaking of deriving such a model 
produces a description of all the relations between all 
the business tasks (or called activities in the area of 
WFMS) encompassed in the subject business process. 
In this term 'relations between all business tasks' means 
the precedence relation between the tasks or in other 
words the predecessor/successor relationship or the 
potential execution sequences of the tasks. The current 
invention dramatically simplifies and automates the 
process of modeling a business model of a business 
process. The invention allows to start just with the set of 
unrelated activities and discover the real world relations 
between them at a later point in time; data mining and 
OLAP technologies are exploited for this discovery. 
[0057] Typically the various tasks and activities being 
part of a business process have been developed inde- 
pendently and do not know of their mutual existence. 
Very often it depends on the knowledge of the users of 
these pieces of software in which sequence and under 
which conditions the individual activities have to exe- 
cuted. One goal of business reengineering is to deter- 
mine the models of existing business processes. The 
business processes are quite often only known by the 
people working on the various pieces of the business 
processes, i.e. various people have "local knowledge" of 
the business processes they are working on. Thus often 
there is no single person in an enterprise knowing the 
global interdependence and sequence of execution of 
the individual pieces. Different business reengineering 
methodologies are used today to capture this local 
knowledge and to derive the "global model" of the over- 
all business process out of it. But in essence all of these 
approaches actually only help to collect the global busi- 
ness process information and help to represent the 
information in certain ways. Finally all that information 
has to be gathered by interviews from the users of these 
software pieces. 

[0058] The derivation of process models is typically 



addressed by business reengineering tools through 
support of a graphical user interface for collecting proc- 
ess information and drawing graphical representations 
of process models. It is the responsibility of the tool user 
5 to obtain all the information to come up with the process 
model. 

[0059] In many situations process information is not 
complete, if available at all. The quality of the deduced 
process model is highly dependent on the quality of the 

10 data supplied by the people. Often, this turns out to be 
an obstruction of the successful deployment of workflow 
technology or of its deployment at all. 
[0060] The inherent problem is that current methodol- 
ogies of business reengineering are a priori in nature, 

75 i.e. it is assumed that the correct process model of a 
business process to be supported must be specified 
before workflow technology can be used. 
[0061] The current invention proposes a posteriori 
methodology that does not make this assumption, i.e. it 

20 allows to support a business process from the very 
beginning by workflow technology and to derive the pre- 
cise underlying process model at a later point in time 
based on audit data collected by the WFMS during the 
early deployment. Especially, this will help to come up 

25 with high quality models for the business processes to 
be supported. 

4.3 The Solution 

30 [0062] Data mining technology allows to automatically 
detect patterns in huge amounts of data. Especially, it is 
not based on iteratively verifying hypothesises about 
potential patterns within the data (which is intractable, in 
general) but it is deriving such patterns based on much 

35 more efficient analysis algorithms. 

[0063] OLAP (OnLine Analytical Processing) tools 
allow the fast access to large amounts of data, many dif- 
ferent representations and aggregations of the same 
results sets, and the access of data related to already 

40 derived results. While this is in theory true for standard 
relational DBMSs new technologies like multi-dimen- 
sional hypercubes or time-series allow this in a very effi- 
cient way. 

[0064] In general, a WFMS writes for each significant 
45 workflow event a log record into an audit trail. Espe- 
cially, the event that a particular activity has been 
started or terminated and at what time is logged in this 
audit trail. In addition, these events are associated with 
the instance of the process model they are supporting. 
so Consequently, the sequence of execution of the activi- 
ties of a process model are available in the audit log of 
a WFMS on a per instance base. Data mining technol- 
ogy can thus be used to determine patterns of execution 
sequences of activities of business processes from 
55 audit trails. 

[0065] From a global perspective the current invention 
suggests a method of evolutionary discovering and 
adapting a process model of a business process. The 
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methodology is based on an approach in which the 
WFMS automatically is tracking the execution behavior 
of the various activities as executed by the users. Based 
on this tracked audit trail records the WFMS automati- 
cally derives an improved version of the process model. 
This improved process model is then used within the 
next sampling period to derive once more an improved 
process model. The suggested approach will then be 
iterated over and over again to steadily improve and 
finally discover the actual process model. 
[0066] Thus the current invention teaches the follow- 
ing aspects: 

□A methodology using a current process model 
which determines a next process model evolution- 
ary improved and adapted to the business process. 
OA methodology to determine an initial version of 
the current process model. 
OA methodology of iteratively repeating above 
determination step exploiting the adapted process 
model and use it as current process model to auto- 
matically generate once more a next process 
model. 

[0067] While current methodologies of defining proc- 
ess models are a priori in nature, i.e. it is assumed that 
the correct process model of a business process to be 
supported must be specified before workflow technol- 
ogy can be used to execute and control the underlying 
activities making up the business process. 
[0068] The current invention dramatically simplifies 
and automates the process of modeling a business 
model of a business process. The invention allows to 
start just with the set of unrelated activities and discover 
the real world relations between them at a later point in 
time; data mining and OLAP technologies are exploited 
for this disoovery. The current invention thus proposes a 
posteriori methodology. For that purpose the precise 
underlying process model is derived at a later point in 
time based on audit data collected by the WFMS during 
the early deployment of a process model. 

4.3.1 The Initial Process Model 

[0069] The proposed methodology assumes that all 
activities of the process model to be derived are defined 
to the WFMS. For this purpose all programs available to 
support the various activities must be catalogued in the 
WFMS, and all personnel data relevant for the business 
process must be defined. Basically, an activity definition 
then is represented by a pair consisting of a program 
name and a query on the personnel data; The query is 
used to describe all persons having the duty or respon- 
sibility or skill to perform the associated activity, and the 
program specifies the executable supporting the per- 
formance of the activity on a computer. When the 
WFMS detects that a given activity must be performed it 
executes the associated query and all people qualifying 



under that query will get a corresponding notification 
(called a workitem). The associated program will be 
started when a person decides to perform the activity. 
[0070] The proposed methodology of evolutionary dis- 

s covering the model of a business process begins with 
simply enumerating all activities encompassed by this 
business process. This enumeration of activities is the 
first iteration of the model of the particular business 
process, i.e. the model is assumed to be the set of iso- 

10 lated activities ("discrete process model"). The model 
is then set into production, i.e. it can be instantiated as 
a workflow in the WFMS. The operational behavior of 
the WFMS is that it will schedule all activities having no 
predecessor when a process model is instantiated, i.e. 

75 the workitems corresponding to these activities will be 
created. 

[0071] Figure 1 is reflecting one iteration step of the 
current invention starting with the initial process model 
to derive a first and improved next process model. The 
20 initial process model 100 encompasses activities A1 to 
A5 at this point in time not prescribing any sequence for 
their execution. The WFMS 101 is recording the execu- 
tion pattern by writing audit trail records to the audit trail 
103. 

25 

4.3.2 Adapting the Current Process Model 

[0072] Thus, when a process model is instantiated all 
affected users will get all workitems corresponding to all 

30 encompassed activities. Due to their local and peculiar 
knowledge of the underlying business process the 
users will perform the workitems in the appropriate 
sequence. This sequence is recorded automatically 
during execution and is reflected in the WFMS's audit 

35 trail. 

[0073] After an appropriate sampling period of time 
the audit trail will contain the execution sequences of 
many different instances of the business process. This 
data will now be analyzed by data mining algorithms 

40 and OLAP tools so that a set of execution patterns will 
result. The patterns with the appropriate confidence and 
support will be transformed into precedence relations 
between the activities of the process model: The next 
iteration of the process model is created. The former 

45 iteration is withdrawn from production while the new 
process model is put into production. Again, the audit 
trail is analyzed after an appropriate time period to pos- 
sibly adapt the current process model etc.. 
[0074] Figure 1 describes the basic approach of the 

so proposed methodology. Data mining algorithms 1 04 are 
analyzing the audit trial 102 for execution pattern 105 to 
finally automatically generate a next process model 
106. The next process model will then be used as the 
current process model to repeat the methodology again 

55 for further improvement. 

[0075] Figure 2 visualizes a simple example of an 
audit trail encompassing a multitude of audit trail 
records. This example assumes that a business proc- 



9 

02/06/2003, EAST Version: 1.03.0002 



17 EP0895 

ess deals with activities A, B, C, D. Then the first itera- 
tion of the corresponding process model is 
P={A,B,C,D}. The {J-notation indicates a set of activities 
without any precedence relation within the set of activi- 
ties; activities belonging to the same {}-set may be exe- 5 
cuted in parallel. Setting P into production may result in 
the audit trail of Figure 2 generated by the WFMS. For 
simplicity reasons it is further assumed that only one 
process model (the process model P) is executed. Thus 
the 'Process Name* column indicates the execution of w 
this process model P. The column 'Process Instance' 
reflects an indication of the process model instance 
within which the activity identified by the entry in the 
'Activity' column is executed. Finally the column 'Execu- 
tion Interval' contains the execution interval comprising J5 
the starting time and termination time of a particular 
activity execution. 

[0076] Exploiting OLAP tools on the audit trail records 
of the process model of interest the following aggrega- 
tion of precedence relations as reflected in Figure 3 can 20 
be derived. For each process instance of a certain proc- 
ess model and for each activity the first time an activity 
has been executed and the last time it has been termi- 
nated is derived and the corresponding execution inter- 
val is build. Only these records are of interest for the 25 
further proceeding. Next, for each instance it is checked 
which execution intervals intersect: Intersecting inter- 
vals mean parallel executions of the associated activi- 
ties. For all execution intervals of a particular instance 
which do not intersect the sequence of execution inter- 30 
vals determines the precedence relation of the associ- 
ated activities; i.e. the sequence of their execution 
intervals determine in which sequence the activities 
have to be executed. Combining these individual 
sequences to the global view results in the desired next 35 
process model. 

[0077] On the aggregation table of Figure 2 data min- 
ing technology is applied to determine the patterns of 
the precedence relations. The resulting precedence 
relations are reflected in Figure 3. Again the 0-notation 40 
is used. 

[0078] If these patterns are then transformed as 
described above into a global picture the process model 
results. For the current example the corresponding 
process model is visualized in Figure 4. as 
[0079] A further subtlety for the proposed methodol- 
ogy has to be mentioned: A single record in the aggre- 
gation table above which indicates parallel execution of 
activities should overrule all other rows representing 
sequential execution of the activities. Either this must be so 
determined after the mining algorithms have been 
applied or the algorithms have to respect this. 

4.4 Advantages 

55 

[0080] In summary, the current invention consists of 
the following parts: 
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□it exploits data mining technology to detect exe- 
cution patterns of activities based on audit trail 
data; 

□it transform the discovered execution patterns 
into precedence relations between the affected 
activities; 

□it uses a workflow management system to get the 
audit trail required for input of the data mining algo- 
rithms; 

□it discovers process models in an evolutionary 
manner start with the isolated set of activities of the 
subject business process. 

[0081 ] As the fundamental advantage process models 
can be determined in an evolutionary manner. No 
extensive modelling phase has to be performed before 
exploiting workflow technology. Moreover the resulting 
process models might have a higher quality than those 
derived via interviews with people having only local 
knowledge about the business process. While up to the 
current point in time process models have to be defined 
explicitly the current methodology opens the possibility 
to determine the process model automatically by the 
WFMS itself not requiring an extensive modeling phase. 

5 Acronyms 

[0082] 

DB Database 

DBMS Database Management System 
WFMS Workflow Management System 
OLAP OnLine Analytical Processing 

Claims 

1. A computerized method automatically adapting a 
process-model being a model of a business-proc- 
ess 

said process-model comprising a multitude of 
activities being part of said business-process; 

said process-model defining all possible 
allowed sequences of activities of said busi- 
ness-process; 

said method of adapting being characterized 
by 

a pattern-collection-step, said pattern-collec- 
tion-step is based on a current-process-model 
and during said pattern-collection-step each 
activity being executed is recorded within an 
audit-trail by audit-trail-records 

said audit-trail-records encompassing a 
process-model-instance-identification of a 
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process-model-instance within said activity 
is being executed; 

6. 

said audit-trail-records encompassing an 
activity-identification of said activity being s 
executed; and 

said audit-trail-records encompassing an 
execution-interval of said activity compris- 
ing a starting-time and a termination-time 10 
of said activity being executed; 



pattern-collection-step is executed by said WFMS. 

A computerized method automatically adapting a 
process-model according to claim 2, 3, 4 or 5 
wherein said repeated application starts with an ini- 
tial-current-process-model which allows for parallel 
execution of all activities within said initial-current- 
process-model. 



said method of adapting being further charac- 
terized by 

a pattern-analysis-step analyzing after a pre- 
defined sampling period said audit-trail-records 
and generating a next-process-model 



by determining in a first step based on the cur- 20 
rent-process-model from the audit-trail-records 
with identical process-model-instance-identifi- 
cations activities, which are processed sequen- 
tially being recorded with non-overlapping 
execution-intervals, and activities, which are 25 
processed in parallel being recorded with over- 
lapping execution-intervals; and 



by adapting in a second step said current-proc- 
ess-model to generate a next-process-model 30 
by re-ordering those activities, allowed to be 
processed in parallel according to the current- 
process-model but actually have been exe- 
cuted sequentially only, as determined in said 
first step, into sequential execution order based 35 
on the time order of said execution-intervals. 



2. A computerized method automatically adapting a 
process-model according to claim 1 being charac- 
terized by repeated application of said method of 40 
adapting in using said next-process-model as said 
current-process-model. 



3. A computerized method automatically adapting a 
process-model according to any of above claims 45 
wherein said pattern-analysis-step never re-orders 
two activities in a sequential execution order if said 
two activities have been determined at least once 

as being processed in parallel. 

so 

4. A computerized method automatically adapting a 
process-model according to any of above claims 
wherein said current-process-model and said next- 
process-model are managed and executed by a 
Workflow-Management-System (WFMS). 55 

5. A computerized method automatically adapting a 
process-model according to claim 4 wherein said 
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